04:00
2026-06-24
arxiv.org
ai-safety
REALM: A Unified Red-Teaming Benchmark for Physical-World VLMs
Researchers introduced REALM, the first unified red-teaming benchmark for physical-world vision-language models (VLMs), integrating 12 attack methods, 3 defenses, and 13 VLMs under a black-box threat โฆ